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ABSTRACT 

We use the GALFORM semi-analytical model to study high density regions traced by radio 
galaxies and quasars at high redshifts. We explore the impact that baryonic physics has upon 
the properties of galaxies in these environments. Star-forming emission-line galaxies (Lyor and 
Ha emitters) are used to probe the environments at high redshifts. Radio galaxies are predicted 
to be hosted by more massive haloes than quasars, and this is imprinted on the amplitude of 
galaxy overdensities and cross-correlation functions. We find that Lya radiative transfer and 
AGN feedback indirectly affect the clustering on small scales and also the stellar masses, star- 
formation rates and gas metallicities of galaxies in dense environments. We also investigate the 
relation between protoclusters associated with radio galaxies and quasars, and their present- 
day cluster descendants. The progenitors of massive clusters associated with radio galaxies 
and quasars allow us to determine an average protocluster size in a simple way. Overdensities 
within the protoclusters are found to correlate with the halo descendant masses. We present 
scaling relations that can be applied to observational data. By computing projection effects 
due to the wavelength resolution of modern spectrographs and narrow-band filters we show 
that the former have enough spectral resolution to map the structure of protoclusters, whereas 
the latter can be used to measure the clustering around radio galaxies and quasars over larger 
scales to determine the mass of dark matter haloes hosting them. 
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1 INTRODUCTION 

A fundamental ingredient of our understanding of galaxy forma¬ 
tion and evolution is the cosmological growth of structure in the 
Universe. The hierarchical growth arises from the non-linear evol¬ 
ution of the dark matter (DM) density field under the action of grav¬ 
ity (for a review, see Springel et al. 2006). In this scenario, galaxies 
form in gravitational potential wells (i.e. dark matter haloes) that al¬ 
low gas to cool and form stars, populating the cosmic web (White & 
Rees 1978). As a result, galaxies form and evolve in a diverse range 
of environments, from voids (or regions with densities well below 
the average), to field galaxies residing in average environments and 
to highly overdense regions. 

Observational studies of galaxy properties in different envir¬ 
onments have shown unequivocally that their properties are some¬ 
how connected with the environment in which they reside (e.g. 
Oemler 1974; Dressier 1980; Hashimoto et al. 1998; Kauffmann 
et al. 2004). Galaxy clusters, the most massive virialised structures 
in the Universe, have galaxy number densities of up to a few hun- 
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dred times higher than the average. In such high density environ¬ 
ments galaxies follow a well known morphology-density relation 
(e.g. Dressier 1980; Balogh et al. 1997; Goto et al. 2003). This in¬ 
dicates that red, early-type galaxies are more abundant than blue, 
late-type galaxies in these environments. 

Different baryonic processes are thought to be responsible for 
the transformation of galaxies in these environments, such as ram- 
pressure stripping and tidal interactions (Moore et al. 1996; Kawata 
& Mulchaey 2008; van den Bosch et al. 2008; Tecce et al. 2010). 
More recently, the quenching of star-formation due to AGN feed¬ 
back has been proposed as a key mechanism regulating the star- 
formation of galaxies hosted by massive haloes (e.g. Croton et al. 
2006; Bower et al. 2006; Lagos et al. 2008). 

From a cosmological perspective, these large structures we see 
in the local Universe are expected to grow from the aggregation of 
smaller haloes foimed at earlier epochs (Press & Schechter 1974; 
Peebles & Shaviv 1982; Davis et al. 1985; Lacey & Cole 1993). The 
hierarchical structure formation scenario implies that massive dark 
matter haloes are likely to be embedded in overdense regions. This 
suggests that any galaxy property related to halo mass will also be 
correlated with environment on scales beyond its host halo, i.e. in 
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super-halo scales (e.g. Mo et al. 2004). This is particularly import¬ 
ant from the perspective of the formation of galaxy clusters, i.e. 
in the protoclusters regime (Overzier et al. 2006). These massive 
structures at high redshifts will eventually form a bound cluster at 
late times, and thus galaxies are expected to have properties that are 
connected to their host halo and its surroundings. 

The search for protoclusters at high redshifts can thus be a key 
to unveiling the role of hierarchical merging and baryonic processes 
in galaxy formation and evolution. Observationally, detecting these 
structures has proven challenging. Common techniques used to de¬ 
tect nearby clusters, such as looking for the thermal X-ray emission 
from the intra-cluster medium (ICM) are not sensitive enough at 
z > 2, where clusters are expected to be in the early stages of form¬ 
ation. Searches for galaxy overdensities in wide-held surveys are 
more effective, but unless the galaxy redshifts are conhrmed spec¬ 
troscopically, the observed overdensities are likely to be affected 
by redshift uncertainties that are larger than the physical size of the 
protocluster itself (Chiang et al. 2013). In addition, projection ef¬ 
fects can smear out the real overdensity signal, or even enhance it 
depending on the viewing angle (Shallow et al. 2013). This makes 
the identihcation of overdense structures at high redshifts contro¬ 
versial. 

Instead of looking for overdensities in a blind survey, it is also 
possible to map the environments around objects that are good can¬ 
didates to lie in a density peak. Generally speaking, the most lumin¬ 
ous galaxies are found to be highly clustered, meaning that these 
are hosted by massive haloes (e.g. Norberg et al. 2001). High red- 
shift quasars are among the brightest objects in the Universe, so 
it is commonly assumed that these trace the most massive struc¬ 
tures as well (Steidel et al. 2005; Kashikawa et al. 2007; Overzier 
et al. 2009; Utsumi et al. 2010; Banados et al. 2013; Husband et al. 
2013). Other luminous objects used to find overdensities are the 
so-called Lya-blobs, which can typically reach luminosities above 
10“*^[erg s“' over extended regions of hundreds of kpc (Erb 
et al. 2011; Matsuda et al. 2011; Uchimoto et al. 2012). A third 
candidate for tracers of massive structures are radio galaxies, which 
can also have typical stellar masses of the order of 10'^[Mo/h] (Mi- 
ley & De Breuck 2008; Kauffmann et al. 2008; Donoso et al. 2010; 
Falder et al. 2010; Ramos Almeida et al. 2013; Karouzos et al. 
2014; Hatch et al. 2014). Fanidakis et al. (2013) showed that, in 
the context of a hierarchical galaxy formation model, high redshift 
quasars are not hosted by the most massive haloes, nor are they 
the progenitors of the most massive clusters that we observe today. 
Instead, the model predicts that luminous radio galaxies reside in 
very massive haloes, and are thus better tracers of the most massive 
protoclusters. In this paper we make use of the same galaxy form¬ 
ation model described in Fanidakis et al. (2013) to characterise the 
environments around these two types of active galaxies. 

In order to map the environment around a protocluster can¬ 
didate the redshifts of the objects must be known with sufficient 
accuracy. Typically, this is achieved with spectroscopic follow-up 
of a sample of photometric candidates, or by utilising a narrow- 
band filter chosen to look for a specific emission line at the redshift 
of the protocluster. Using Lyo' emitters to map the environment in a 
protocluster, Venemans et al. (2007) found that radio galaxies seem 
to pinpoint structures with masses ~ over the red¬ 

shift range 2 < z < 5. Likewise, Saito et al. (2014) determine an 
overdensity of Lya emitters around a radio galaxy at z ~ 4 that is 
only marginally reproduced in the galaxy formation model of Orsi 
et al. (2008). 

Here, we explore the properties and evolution of overdense 
regions by focusing specifically on those traced by radio galaxies 


and quasars in a theoretical framework. Radio galaxies are expected 
to trace haloes that are subject to star-formation quenching due to 
AGN feedback in massive haloes. Quasars, on the other hand, are 
characterised by a rapid accretion of cold gas triggered by mergers 
or disk instabilities, which places them in a broader range of halo 
masses. The most luminous quasars also experience star-formation 
quenching due to AGN feedback (Fanidakis et al. 2013). 

In this paper we tackle two main problems related to these two 
types of active galaxies and their environments. First, we character¬ 
ise the impact of baryonic processes on samples of emission-line 
galaxies populating overdense regions at high redshifts traced by 
radio galaxies and quasars. Second, by identifying the haloes that 
host radio galaxies and quasars we derive a simple way to define 
the size of protoclusters, and link their overdensities to the mass of 
the descendant haloes at z = 0. 

The backbone of this study is the GALFORM semi-analytical 
model (Cole et al. 2000; Baugh et al. 2005; Bower et al. 2006; La¬ 
gos et al. 2011). This model incorporates state-of-the-art prescrip¬ 
tions for the evolution of galaxies and their central supermassive 
black holes (BH, Fanidakis et al. 2011, 2013), and at the same 
time uses a Monte Carlo radiative transfer code for Lyor photons to 
derive physically robust Lya luminosities for high redshift galax¬ 
ies (Orsi et al. 2012). GALFORM predicts observational properties 
of galaxies, attempting to include all relevant physical mechanisms 
in the galaxy formation and evolution process. Hence, our choice 
of specific tracers of high density regions at high redshifts, and 
the galaxy populations used to measure the environment around 
them, are all predictions that our model provides within a robust 
framework, making it suitable to be confronted against observa¬ 
tional measurements. 

The structure of this paper is as follows. Section 2 describes 
the galaxy formation model used; Section 3 describes our results 
analysing both overdense regions and defining protoclusters traced 
by radio galaxies and quasars. Finally, Section 4 summarises our 
main findings and discusses their implications. 


2 THE THEORETICAL GALAXY EORMATION MODEL 

This section describes the semi-analytical model of galaxy form¬ 
ation used, its main features, and the modelling of active galaxies 
and emission-lines in star-forming galaxies. 

We make use of the GALFORM semi-analytical model of galaxy 
formation to predict the properties of galaxies as a function of red¬ 
shift. This model is described in detail elsewhere (Cole et al. 2000; 
Benson et al. 2003; Baugh et al. 2005; Bower et al. 2006; Lagos 
et al. 2011). The variant of GALFORM used here is presented in 
Lacey et al. (2015). 

In short, GALFORM computes the formation and evolution of 
galaxies in the context of the hierarchical growth of DM structures. 
The properties and merging histories of DM haloes are extracted 
from the Millennium-WMAP7 dark-matter only N-body simula¬ 
tion. The halo mass resolution is ~ lO'^iMoA^*] and its box-side 
length is 500[Mpc A“']. Hence, this simulation is similar to the 
well known Millennium run (Springel et al. 2005), except that it 
was run with a set of updated cosmological parameters taken from 
Komatsu et al. (2011) obtained using the WMAP-7 dataset, i.e. 
a,, = 0.0455, Dm = 0.272, Da = 0.728, n, = 0.967,0-8 = 0.810 
and h = 0.704. 

The main baryonic processes that enter in the GALFORM cal¬ 
culation are i) the shock-heating and radiative cooling of gas in¬ 
side haloes leading to the formation of a disk, ii) quiescent star- 
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formation in the disk, and starbursts in a galactic bulge following 
galaxy mergers and disk instabilities, iii) feedback due to super¬ 
novae, AGN and photoionisation which regulate the star-formation 
process, and iv) the chemical enrichment of the gas and stellar com¬ 
ponent. Galaxy luminosities are computed using a population syn¬ 
thesis model (Cole et al. 2000; Gonzalez-Perez et al. 2014). Dust 
extinction for the stellar continuum is calculated self-consistently 
based on the radiative-transfer model described in Lacey et al. 
( 2011 ). 

The variant of GALFORM used here incorporates features from 
different versions of the model that were used to study specific 
problems into a single model to provide a powerful galaxy form¬ 
ation tool. Most notably, the model invokes a different initial mass 
function (IMF) for quiescent and starburst events. A top-heavy IMF 
is used in the latter case to explain the abundance of high redshift 
galaxies detected in the sub-mm (Baugh et al. 2005). The model 
also includes a treatment of star-formation in disks following the 
atomic and molecular hydrogen content of the gas (Lagos et al. 
2011), and stellar luminosities using the Maraston (2005) stellar 
population synthesis model that incorporates the contribution from 
thermally-pulsating asymptotic giant branch (TP-AGB) stars (see 
also Gonzalez-Perez et al. 2014). 

2.1 Modelling emission-line galaxies 

In order to obtain the line fluxes of several hydrogen recombina¬ 
tion lines, GALFORM computes the total production rate of hydrogen 
ionizing photons (Lyman continuum photons) by integrating the 
composite spectral energy distribution (SED) of each galaxy over 
the extreme-UV continuum down to the Lyman break at T = 912A. 
Then, by assuming that all of these ionising photons are absorbed 
within the interstellar medium (ISM) of the galaxy (i.e. the escape 
fraction of ionising photons is set to zero), case B recombination is 
used to convert a fraction of the Lyman continuum photons into dif¬ 
ferent line fluxes (Osterbrock 1989; Dijkstra 2014). This intrinsic 
luminosity is later adjusted for the effect of dust attenuation by 
computing the continuum extinction at the wavelength of the line. 

We apply the procedure outlined above to obtain the Hfr lu¬ 
minosities of galaxies. In the case of Lya the intrinsic luminosity 
of Lyo- photons is expected to be reduced by the scattering of Lyor 
photons by neutral hydrogen atoms in the ISM and their absorp¬ 
tion by dust grains. The high scattering cross-section of photons at 
the Lya line centre makes these photons undergo numerous scat¬ 
tering events with hydrogen atoms, resulting in large path lengths, 
and thus they are likely to be absorbed by dust grains present in the 
ISM. This results in a complex radiative transfer problem that can¬ 
not be accurately accounted for using analytical expressions, except 
for a few idealised configurations (Harrington 1973; Neufeld 1990; 
Dijkstra et al. 2006). 

Instead, we compute the escape of Lya photons using a Monte 
Carlo radiative transfer code for Lya photons that allows us to ob¬ 
tain a value for the Lya escape fraction, fesc, which leads to the 
observed Lya luminosity. A full description of the radiative trans¬ 
fer code and its implementation in GALFORM can be found in Orsi 
et al. (2012). In short, the code follows the scattering, absorption 
and escape histories of a large number of photons (in this case, of 
the order of 10^ to lO'^ for each galaxy). Each photon can change 
its direction and frequency after interacting with hydrogen atoms 
and dust grains. If the former interaction occurs, then the photon 
is scattered, changing its direction, which leads to a change of fre¬ 
quency. If the latter takes place, then the photon can be scattered or 
absorbed, depending on the dust albedo. If a photon is absorbed by 


a dust grain, then it is discarded. If a photon escapes from the me¬ 
dium, its final frequency is recorded. The process is repeated until 
an accurate value for the escape fraction of Lya photons is attained. 
Eollowing Orsi et al. (2012), we restrict the total number of photons 
for a given run to be at least 10^, and up to 5 x 10“* when no photon 
was absorbed. Hence, the minimum escape fraction our model can 
compute is /esc = 2 x 10“^. 

There is much observational evidence for the presence of out¬ 
flows in Lya emitters at high redshifts (e.g. Giavalisco et al. 1996; 
Thuan & Izotov 1997; Kunth et al. 1998; Mas-Hesse et al. 2003; 
Shapley et al. 2003; Kashikawa et al. 2006; Hu et al. 2010; Komei 
et al. 2010). Hence, Orsi et al. (2012) assumed simple outflow geo¬ 
metries for the Lya photons to escape. Eor simplicity, two iso¬ 
thermal, spherically-symmetric models of galactic-scale outflows 
were adopted: an expanding thin shell and an expanding wind. Both 
are similar and their properties, such as expansion velocity, size and 
metallicity are directly proportional to the galaxy’s predicted cold 
gas mass, circular velocity, half-mass radius and cold gas metal¬ 
licity, respectively. In addition, the wind geometry displays a gas 
density profile of the form p{r) oc where the normalisation 
depends on the mass-ejection rate predicted by the supernova feed¬ 
back implemented in GALFORM. Eor simplicity, we will hereafter 
use the thin shell geometry. We have checked that our predictions 
are not sensitive to the choice of the outflow geometry. Since the 
Orsi et al. (2012) model was developed using an earlier version of 
GALFORM (described in Baugh et al. 2005), we re-calibrated the free 
parameters controlling the relation between the half-mass radii of 
galaxies and the inner radius of the outflows for each galaxy. The 
new parameter values are found by matching the luminosity func¬ 
tion of Lya emitters in the redshift range 0.2 < z < 6.6 (for details 
of the fitting procedure, see Orsi et al. 2012) For starbursts, this 
relation is described by 

7^m„er = *(l+t:)"<f^l/2>, (D 

where k = 0.2, y - I and (Rip) is the average of the half-mass radii 
of the disk and bulge components of the galaxies, weighted by their 
intrinsic Lya luminosity. For quiescent galaxies, /?i„ner = '^(Ri/i)- 

2.2 Modelling of radio galaxies and quasars 

To model radio galaxies we use the AGN prescriptions described 
in Fanidakis et al. (2011). The Fanidakis et al. model follows the 
mass accretion rate onto the BHs and the evolution of the BH mass, 
Mbh, and spin, a, allowing the calculation of a variety of predic¬ 
tions related to the nature of AGN. In this model the evolution of 
BHs and their host galaxies is fully coupled: BHs grow during the 
different stages of the evolution of the host by accreting cold gas 
(merger/disk-instability driven accretion: starburst mode) and hot 
gas (dilTuse halo cooling driven accretion: hot-halo mode) and by 
merging with other BHs. This builds up the mass and spin of the 
BH, and the resulting accretion power regulates the gas cooling and 
subsequent star formation in the galaxy. The resulting mass of the 
BH correlates with the mass of the galaxy bulge in agreement with 
the observations, (see Fanidakis et al. 2011, 2012) 

The BH spin distribution depends strongly on how the gas in 
a given accretion episode accretes onto the BH. Fanidakis et al. 
(2011) assume that the accretion flow fragments due to self grav¬ 
ity into multiple accretion episodes (chaotic accretion; King 2005; 
King et al. 2008). In this case, star formation in the vicinity of the 
BH can randomise the angular momentum of the gas, resulting in a 
succession of randomly aligned accretion disks around the BH. The 
end effect of this process is typically a BH with a low spin. High 
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Figure 1. The predicted spatial distribution of Lyu emitters around typical radio galaxies (top) and quasars (bottom) at redshifts 4.5,3.0 and 2.2 (from left to 
right). Each box shows a slice of 20[Mpc/h] on a side and 10[Mpc/h] depth. The DM density distribution is displayed in grayscale. Darker pixels indicate 
a higher density of DM. Green circles show the position of the radio galaxies. Orange circles show the position of the quasars. Blue circles show the spatial 
distribution of Lycr emitters with Lpyo. > 10‘**[erg 


spin values occur only for the most massive BHs (Mbh > IO^Mq), 
because the growth of these BHs is dominated by gas-poor BH-BH 
mergers which always result in fairly rapid spins of a ~ 0.7 - 0.9. 
Thus, in the chaotic accretion scenario there is a clear correlation 
of spin with BH mass and hence with host galaxy bulge mass. 
Massive BHs form in the most massive DM halos, they are hos¬ 
ted by massive elliptical galaxies and have rapid spins, while lower 
mass BHs form in spiral galaxies and have much lower spins. 

The gas accreted during a starburst episode is converted into 
an accretion rate M. The bolometric luminosity of the accretion 
flow associated, Lpoi, will depend on the accretion rate in Eddington 
units m = MIM-^a- If hi > 0.01, then the thin disk solution of 
Shakura & Sunyaev (1973) is used 

Lboi = eMc^, (2) 

where c is the speed of light, and e is an adjustable parameter. Oth¬ 
erwise, the ADAF thick disk solution is adopted (Narayan & Yi 
1994) 

T-boi.ADAF = 0.44 (3) 

Finally, if the accretion is super-Eddington (Lboi > then the 

bolometric luminosity is obtained as (Shakura & Sunyaev 1973) 

Tboi(> lyT^Edd) = ' 7[1 + ln(hi/ 77 )]LEdd, ( 4 ) 

where rj and is an adjustable parameter of the model. 


The mass, spin and mass accretion rate evolution are then 
coupled to the classic Blandford-Znajek jet model (Blandford & 
Znajek 1977). The jet power couples strongly to the accretion 
mode, as it most likely depends on the vertical (poloidal) magnetic 
field component Sp close to the BH horizon, Bjet ~ The 

accretion flow is assumed to form a geometrically thin disk for re¬ 
latively high accretion rates (Shakura & Sunyaev 1973), switching 
at lower accretion rates to a geometrically thick disk in an advec- 
tion dominated accretion flow (ADAF; Narayan & Yi 1994). The 
expression for the mechanical jet energy in each regime is then 
(Meier 2002); 

Bjet,ADAF = 2x lO'^^Mg^^^ja^ergs^*, (5) 

PjehTD = 2.5 X lO'^M' * ( —j fl^ergs-*, (6) 

where Mg is the mass of the BH in units of IO^Mq, a is the spin of 
the BH. The collapse by two orders of magnitude in scale height of 
the flow during the transition from an ADAF to a thin disk results in 
a similar drop in radio power. This already gives a dichotomy in ra¬ 
dio properties which may explain some of the distinction between 
radio-loud and radio quiet objects. Using these prescriptions for 
the accretion disk and jet, we calculate the optical and radio out¬ 
put from accreting BHs. The model fits the luminosity function 
of radio-loud AGN remarkably well when low mass objects have 
lower jet powers than high mass objects. This is achieved because 
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Figure 2. The halo mass function of radio galaxies (blue) and quasars (red) 
in our model. The grey line corresponds to the halo mass function of the 
overall galaxy population. Panels correspond to redshifts ^ = 4.5,3.0 and 
z = 2.2 as labelled. 
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Figure 3. The overdensity of Lycr emitters, (5^, around radio galaxies and 
quasars at redshift z = 3, as a function of the distance from the central 
object. The solid curve and the shaded regions show the median and 10-90 
percentile of the overdensity distribution for a sample of Lya emitters with 
iLya > 10""[ergs“' 


the jet couples strongly to the BH spin and the lower mass BHs 
have lower spin than the most massive BHs. Overall, the model 
predictions for the AGN population can reproduce the diversity of 
nuclear activity seen in the local and high-z Universe (Fanidakis 
et al. 2011, 2012). The predictions for the radio-optical luminosit¬ 
ies of FR-I, BLRG, Seyfert and LINER are in good agreement with 
the observed galaxy populations. 

Based on the disk and jet luminosity the model calculates for 
every accreting BH, we define as a quasar any galaxy whose central 
BH produces a disk luminosity higher than erg s“*. In contrast, 
a galaxy is defined as radio galaxy when its jet luminosity exceeds 
Fi.4GHz = 10^3 WHz-‘. 


3 RESULTS 

We now explore the predictions of GALFORM in the redshift range 
2 < z < 6. This is the redshift interval where the bulk of the ob¬ 
servational work about the environments around radio galaxies and 
quasars using Lya emitters has been carried out. Compiling statist¬ 
ical samples of Lya emitters is challenging at low redshifts (z < 2), 
and has only been possible with the GALEX satellite (e.g. Dehar- 
veng et al. 2008; Cowie et al. 2010; Wold et al. 2014). At z > 6, our 
model predicts that radio galaxies are very rare, such that it is not 
possible to robustly characterise the population. 

Throughout this paper, we measure overdensities around radio 
galaxies and quasars in redshift space. This means that the peculiar 
velocity of galaxies in the radial direction contributes to the ob¬ 
served redshift of galaxies, producing a distortion of their derived 
comoving distances. To introduce this effect into our model, we 
take the comoving z-coordinate, to represent the line-of-sight 
direction and replace it by its value in redshift space s\ 


where is the peculiar velocity of the galaxy along the line-of- 


sight, a is the expansion factor and H(z) the Hubble parameter eval¬ 
uated at redshift z. 

Most of our predictions focus on four redshifts z = 2.2,3.0,4.5 
and 5.7. These are the redshifts at which Lya has been detected 
from the ground with negligible atmospheric contamination. In ad¬ 
dition, z = 2.2 is particularly important since it is also the redshift 
at which a ground-based near-infrared (NIR) instrument can typic¬ 
ally search for Ha emitters (e.g. Geach et al. 2008; Koyama et al. 
2013), although Ha searches have extended up to z ~ 2.5 (Cooke 
et al. 2014) 

Fig. 1 shows the environment around selected radio galax¬ 
ies and quasars within the redshift range discussed above, as pre¬ 
dicted by GALFORM. Each image displays a cuboid of 20 x 20 x 
10[Mpc^/!“^], which illustrates the complicated filamentary struc¬ 
ture of the DM, shown in grey. Lya emitters with luminosities 
log(Li^„[erg s“' h^^J) > 41 are shown in blue. 

Radio galaxies and quasars trace different environments as 
a consequence of their different triggering mechanisms. Quasar 
activity is triggered in gas-rich galaxies, typically associated with 
haloes of mass lO'^fMg/A] (Fanidakis et al. 2013). For radio galax¬ 
ies, we find that the brightest objects live in the most massive ha¬ 
loes because this is where the spin and mass of BHs are higher, but 
also the accretion rate is low enough to form an ADAF which gives 
powerful jets, as discussed in the previous section. 

Fig. 2 shows the mass function of haloes that host radio galax¬ 
ies and quasars at different redshifts. Both types of AGN can be 
found in massive haloes. However, they represent a small subset of 
the population of massive haloes at any redshift. Furthermore, the 
fraction of haloes hosting a radio galaxy or quasar at a given halo 
mass is predicted to increase with redshift: At z = 2.2, radio galax¬ 
ies and quasars account for only 4.5 per cent of the haloes with 
mass above > 10*^[Mq//i]. At z = 3.0 this fraction increases 
to 5.2 per cent, and at z = 4.5 to 12 per cent. 

Quasars are significantly more abundant than radio galaxies at 
all redshifts and halo masses, and they also span a larger range of 
halo mass, peaking at Mhaio ~ 10**^ - lO'^fMo/A]. Radio galax- 
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z=4.5 z=3.0 z = 2.2 



Figure 4. The clustering and properties of Lycr emitters around radio galax¬ 
ies (blue) and quasars (red) as a function of distance from the central object 
for redshifts z = 4.5,3.0 and z = 2.2. Solid lines coirespond to predictions 
for the faint sample of Lyor emitters, whereas dashed lines comespond to the 
bright sample. (Top): the cross correlation function between radio galaxies, 
quasars, and hya emitters. (Middle): The mean halo mass of Lya emitters 
as a function of distance to the central object. (Bottom): The mean fraction 
of starburst galaxies as a function of distance to the central object. 
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Figure 5. Same as in Fig. 4 but using Hu emitters at z = 2.2. The left panel 
shows results for the fiducial model, whereas the right panels corresponds 
to the case in which AGN feedback has been turned off. 


ies, on the other hand, populate a small subset of the most massive 
haloes, peaking above Mhaio ~ 10'^[Mo//i] . 

The environments of these two types of AGN are mostly dom¬ 
inated by this fundamental difference in their halo mass distribu¬ 
tion. Furthermore, baryonic processes that are important over these 
halo mass ranges can also have an impact on the properties of the 
galaxy population used to trace the environment and overdensities. 

Observations and models have shown that star-forming galax¬ 
ies tend to avoid the centres of massive structures (e.g. Orsi et al. 
2010; Contreras et al. 2013). This is due to the star-formation 
quenching mechanisms that are expected to act in overdense re¬ 
gions. The typical star-formation timescale that is traced by nebular 
emission is of the order ~ lOMyr. Hence, any baryonic process that 
results in an abrupt quenching of star-formation, such as AGN feed¬ 
back, is expected to affect the line luminosities of galaxies, given 
its short timescale. Hence, the properties of emission-line galaxies 
(ELGs) around a massive structure will be related not only to the 
environment itself, but also to the depth (i.e. the limiting flux) of 
the galaxy sample used to trace such environments. Hereafter, we 
study the dependence of a number of properties on environment 
by splitting our ELG sample into "faint", i.e. those galaxies with 
line luminosity L > 10'*'[erg s“' which represent a deep sur¬ 
vey, and "bright", with L > 10''^[erg s“’ h^^], which represents a 
shallow survey. From the observational perspective, these repres¬ 
ent complementary strategies. A deep and small survey can char¬ 
acterise the properties of galaxies within their host halo in a small 
volume, whereas a shallower and wider survey could be used to 
measure statistical properties, such as the galaxy clustering around 
these massive structures. 


3.1 The environment of overdense regions traced by radio 

galaxies and quasars 

We start by characterising the properties of overdense regions 
traced by radio galaxies and quasars. The detection of these over- 
dense regions at high redshifts typically requires long exposures 
and dedicated observations via narrow-band imaging or spectro¬ 
scopic follow-up. It is thus common for observational studies to 
focus on one or only a handful of central objects (e.g. Steidel et al. 
2005; Venemans et al. 2005; Overzier et al. 2006; Venemans et al. 
2007; Kuiper et al. 2011; Husband et al. 2013; Banados et al. 2013; 
Saito et al. 2014; Chiang et al. 2015; Adams et al. 2015). Their in¬ 
terpretation, from a galaxy formation perspective, is, thus, limited 
by cosmic variance and the lack of statistical samples. In addition, 
projection effects can smear out the overdensity significantly, even 
in narrow-band surveys (Chiang et al. 2013). 

In Fig. 3 we illustrate the predicted overdensities of faint Lycr 
emitters around radio galaxies and quasars at z = 3 as a function 
of distance in redshift space. We define the galaxy overdensity as 
Sg(r) = (n(r) - n)/n, where n(r) is the number density of galaxies 
within a sphere of radius r around a central object, and h is the 
galaxy number density averaged over the simulation volume. 

These predictions represent the ideal case in which no pro¬ 
jection effects affect the measured overdensities. Overall, the me¬ 
dian overdensity around radio galaxies is predicted to be higher 
than that around quasars. This is consistent with the correlation 
between halo mass local overdensity. However, the scatter of 6g 
around the median (shown in the plot as the 10 and 90 percentile 
range of the distribution of overdensities around each radio galaxy 
and quasar in the model) is significant. For example, at distances 
below ~ Ifi^'Mpc, the distribution of overdensites around radio 
galaxies and quasars can span several orders of magnitude due to 
the small number of galaxies at these distances. At larger distances 
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from the central object, overdensities of quasars display a larger 
scatter than around radio galaxies. This is likely to be caused by the 
larger range of halo masses that are hosts of quasars, as opposed to 
radio galaxies that are found in a much narrower halo mass range 
(see Fig. 2). 

By studying a sample of overdensites around radio galaxies 
above z > 2, Venemans et al. (2007) reported that 2 out of 8 of 
their radio galaxies have environments that are consistent with be¬ 
ing equivalent to the field. Our model predicts that the fraction of 
radio galaxies hosted in average or underdense environments in¬ 
creases towards lower redshifts and lower host halo masses. Also, 
this fraction is significantly lower when measuring the overdensity 
of galaxies with the faint samples of galaxies instead of the bright 
ones. At z = 2.2, for instance, 20% of radio galaxies hosted by ha¬ 
loes with mass Mhaio ~ 10*^ are in environments with (5gai < 0 of 
bright Lya emitters. This fraction is reduced to ~ 9% when measur¬ 
ing 5gai with faint galaxies. In quasars, these fraction are generally 
higher, reaching up to 35% for the same halo masses. This popu¬ 
lation of underdense radio galaxies and quasars arise partly due to 
cosmic variance and, as shown in Section 4, due to the fact that 
most of the radio galaxies and quasars with dg^i 0 do not evolve 
to become massive clusters, but instead become average haloes at 
z = 0. 

A better way to quantify the clustering is to compute the cross¬ 
correlation function between central quasars or radio galaxies and 
ELGs, This is estimated as 


^cc{r) = 


DD{r) 

Ac«gaiAF(r) 


( 8 ) 


where DD(r) is the total number of galaxies around central objects 
at a distance r ± Ar/2, Nc is the total number of central objects in 
the simulation box, ngai is the mean number density of galaxies, and 
AV{r) is the volume of a spherical shell of radius r and width Ar. 
This width corresponds to the bin size used to compute Eq. 
(8) is suitable for computing the cross-correlation function because 
our simulation box is periodic, and thus, the pair counts are not 
alfected by edge effects. As a result, there is no need to make use 
of estimators that rely on random sets of objects. 

Fig. 4 shows the predicted cross-correlation function between 
radio galaxies and Lyff emitters, and quasars and Lytt emitters at 
three redshifts spanning 2.2 < z < 4.5. The amplitude of is 
higher when using radio galaxies as the central objects than it is for 
quasars for any redshift, as expected. An interesting feature arises 
at small scales (i.e. r < 2/!“*Mpc), where has a higher amplitude 
when computed using the sample of faint Lya emitters than when 
using the bright sample. This might seem counterintuitive at first, 
since we would expect brighter objects to be more clustered than 
faint ones. 

To investigate the origin of this significant difference in the 
amplitude of clustering at small scales we look at the properties 
of the galaxies surrounding radio galaxies and quasars. The mean 
halo mass of Lya emitters as a function of distance is shown in 
the middle panels of Fig. 4. The mean halo mass of Lya emit¬ 
ters around radio galaxies is about 0.5 dex higher than that of 
Lya emitters around quasars. This is a consequence of the hier¬ 
archical growth of structures, in which massive haloes are surroun¬ 
ded by other massive haloes. In both samples surrounding quas¬ 
ars and radio galaxies, faint Lya emitters are hosted by haloes 
that are slightly more massive than brighter ones. At separations of 
r > 10/i^'Mpc, the mean halo mass of both faint and bright galax¬ 
ies tend to converge. This is translated into their cross-correlation 
functions having virtually the same amplitude. 


Lya Ha 
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Figure 6. The mean stellar mass (top), SFR (middle) and gas metalli- 
city (bottom) as a function of distance from radio galaxies and quasar's 
at z = 2.2. Solid curves are computed using environments around radio 
galaxies, whereas dashed curves represent the same around quasars. The 
left column shows the properties of Lya emitters. Blue and red corresponds 
to the fiducial model and a variant without radiative transfer effects, respect¬ 
ively. The right column shows the properties of Ha emitters. Here blue and 
green are the fiducial and a variant with no AGN feedback, respectively. 
In both columns, faint colours correspond to the sample of bright galaxies, 
whereas darker colours con'espond to the faint samples. 


To understand why faint Lya emitters are hosted by more 
massive haloes than bright Lya emitters are, we look at the relation 
between line luminosity and halo mass. Overall, both quantities are 
correlated. However, as shown in Orsi et al. (2012), the correla¬ 
tion differs depending on whether galaxies are forming stars quies¬ 
cently or in starbursts. In this variant of GALFORM, quiescent galax¬ 
ies form stars following a Kennicutt (1983) IMF. Starbursts, on the 
other hand, form stars with a top-heavy-like IMF (see Baugh et al. 
2005). This IMF produces about 4 times more ionising photons 
for each star-formation episode than a “normal” IMF (Le Delliou 
et al. 2005). This implies that lower mass haloes that experience 
a starburst can be brighter in Lya luminosity than a more massive 
halo forming stars quiescently. The bottom panels of Fig. 4 show 
precisely this: the bright sample of Lya emitters consists almost 
entirely of starbursts, whereas the faint sample contains at most 40 
per cent of starbursts. 
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Table 1. Protocluster radius r*[Mpc//t], median completeness/purity C, mean descendant halo mass M(z = 0)[MqIK\ and fraction of cluster descendants 
f(M > 10*'*[Mo//j]) of Lyo- samples around radio galaxies and quasars for different redshifts. 


Redshift 

^niriio 

C'radio 

log M„dio(z = 0) 

/radio(M > 10") 

^qso 

C'qso 

log Mqso(z = 0) 

/,,o(M > 10'®) 

5.7 

9.37 

0.88 

14.59 

0.76 

6.88 

0.73 

14.07 

0.34 

4.5 

6.97 

0.85 

14.48 

0.69 

6.04 

0.69 

14.01 

0.29 

3.0 

5.77 

0.84 

14.34 

0.58 

5.08 

0.61 

13.91 

0.23 

2.2 

5.12 

0.78 

14.26 

0.50 

5.00 

0.50 

13.83 

0.19 


The difference in the fraction of starbursts between faint and 
bright samples of Lytt emitters is also enhanced by the Lya radi¬ 
ative transfer model, which assigns low escape fractions to quies¬ 
cent galaxies, and higher ones to starbursts, as shown in Orsi et al. 
(2012). Hence, the resulting cross-correlation functions shown in 
Fig. 4 are the result of a combination of: AGN modeling (which res¬ 
ults in radio galaxies populating more massive haloes than quasars), 
the hierarchical clustering of haloes (which leads to more massive 
haloes surrounding radio galaxies than quasars), the choice of the 
IMF for quiescent galaxies and starbursts (which is responsible for 
most of the bright Lya emitters in lower mass haloes than quiescent 
Lya emitters), and finally the Lya radiative transfer model favour¬ 
ing the escape of Lya photons in starbursts. 

Unlike Lya emitters, Ha emitters are not subject to complex 
radiative transfer due to resonant scattering, and their attenuation 
by dust is smaller, making these galaxies excellent tracers of the 
instantaneous SFR (Kennicutt 1998; Calzetti 2013). Apart from the 
radiative transfer effects, however, Ha emitters are essentially equi¬ 
valent to Lya emitters in nature. Hence, by comparing the proper¬ 
ties of the same environments traced by Lya and Ha emitters we 
can obtain a better picture of how environmental processes affect 
the galaxy properties. 

We show the cross-correlation functions between radio galax¬ 
ies, quasars and samples of faint and bright Ha emitters at z = 2.2 
in Fig. 5. As with Lya emitters, the cross-correlation function in¬ 
volving radio galaxies has a higher amplitude than that with quas¬ 
ars as central objects. However, unlike with Lya emitters, there is 
little difference between the correlation functions when using the 
bright and faint samples of Ha emitters, even at small scales. The 
middle panels of Fig. 5 show that the mean halo mass for the two 
samples of galaxies is also very similar. Also, the fraction of star- 
bursts, shown by the bottom panels of Fig. 5 shows that there is a 
similar fraction of starbursts at small scales, although the fraction 
of starbursts increases towards larger scales in the bright sample 
of Ha emitters. At large scales, there is only a weak dependence 
between Ha luminosity and clustering amplitude, measured by the 
bias factor b (Orsi et al. 2008). 

In order to understand what is causing galaxies in the faint 
and bright samplea of Ha emitters to have similar halo masses 
and starburst fractions we explore the effect of an environmental 
process that plays a role only on small scales. Since central ob¬ 
jects are active galaxies, we run a variant of the GALFORM model in 
which AGN feedback is switched off, leaving everything else the 
same. Interestingly, this variant of the model mimics the suppres¬ 
sion of the clustering amplitude on small scales that is also evid¬ 
ent in the Lya emitter samples, although in this case the difference 
between the amplitude of the cross-correlations involving faint and 
bright samples is much stronger when the central objects are radio 
galaxies. This is consistent with the expectation that AGN feedback 
plays a stronger role in the environments of radio galaxies than in 
quasars. In the absence of AGN feedback there are two notice¬ 
able changes in the properties of galaxies. On one hand, galaxies in 
massive haloes that would be quenched by AGN feedback are now 


part of both the faint and the bright samples of Ha emitters, thus 
increasing the average halo mass of both populations. On the other 
hand, the bulk of the quiescent galaxies do not reach luminosities 
above 10"^^[erg s“' so the bright sample of galaxies consists 
almost entirely of starburts. Those galaxies, as discussed earlier, 
span a larger range of halo mass because of their higher production 
of ionising photons. Therefore, the bright sample has, on average, 
smaller halo masses than the average fainter ones. As a result, at 
small scales, the mean halo mass of faint Ha emitters is higher 
than that of bright ones, resulting in a significant difference in their 
clustering amplitude." 

The analysis above shows that although the properties of 
galaxies in protoclusters are fundamentally determined by the prop¬ 
erties of their host haloes, the role of baryonic effects on the prop¬ 
erties of galaxies in overdense regions is also predicted to be im¬ 
portant. 

To gain deeper insight into the effect of environment on galaxy 
properties we compute how other galaxy properties change as a 
function of distance from their central objects. In particular, we fo¬ 
cus on the stellar mass Mstdiar, the star-formation rate SFR and the 
cold gas metallicity Zj-ou. Fig. 6 shows the mean values of these 
properties as a function of distance from their central objects for 
both Lya and Ha emitters at z = 2.2. We also compute these quant¬ 
ities using two variants of our fiducial model, one in which there is 
no Lya radiative transfer, and another in which AGN feedback is 
turned off. 

The mean stellar mass of Lya emitters in the field is predicted 
to be about 10® ®[Me/fi], and increases about 0.2 dex in the inner re¬ 
gions of the overdense regions. The stellar mass of galaxies in radio 
galaxy or quasar environments is predicted to be indistinguishable, 
even when comparing faint and bright samples. When Lya radiat¬ 
ive transfer is switched off, the stellar mass is predicted to be about 
0.6 dex higher than in the previous case. For Ha emitters, bright 
galaxies are more massive than faint ones, and the effect of the en¬ 
vironment is more important. When AGN feedback is off, galaxies 
can grow in stellar mass reaching ~ 10**’*[Mo/fi] for small dis¬ 
tances in bright Ha emitters. 

The SFR of faint Lya emitters tends to decrease by up to about 
0.4 dex in overdense environments. Bright Lya emitters, as expec¬ 
ted, have higher SFRs, but they display no environmental depend¬ 
ence. When radiative transfer is disabled, the environmental effect 
on the faint sample is erased, and there is a slight increase of the 
SFR in bright Lya emitters. Ha emitters, on the other hand, also 
present a very small decrease of their SFR in overdense environ¬ 
ments. Interestingly, the variants with AGN feedback turned off 
also present a small decrease in the SFR. This occurs because in 
GALFORM when a galaxy becomes a satellite its hot halo is stripped. 
Such environmental effect reduces the total reservoir of gas to form 
stars, thus producing a quenching of star formation. Recently, Peng 
et al. (2015) found evidence that this effect is a primary mechanism 
for quenching star formation in a sample of local galaxies. 

Finally, gas metallicities tend to increase as a function of en¬ 
vironment density for both Lya and Ha emitters. Interestingly, our 
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Figure 7. The protocluster completeness (red) and purity (blue) of the 
sample of faint Lya emitters around radio galaxies at z = 3.0. The solid 
lines show the median completeness and purity. The shaded regions show 
the 10-90 percentile of the distribution of completeness and purity of Lyn 
emitters at a given distance from a central radio galaxy. The vertical dashed 
line corresponds to the distance at which the completeness and purity are 
equal. 

model predicts that faint Lya emitters should have a higher gas 
metallicity than bright ones, but the opposite is true for Ha emit¬ 
ter samples. This is due to Lya radiative transfer effects. When 
AGN feedback is off, galaxies are predicted to have much higher 
gas metallicities. This is a natural consequence of them having also 
larger stellar masses and SFRs. 

Despite the difference in halo masses hosting radio galaxies 
and quasars, and AGN feedback being less common in quasars 
than radio galaxies, there is very little difference in the environ¬ 
mental dependence of the properties studied here between the two 
tracers. AGN feedback acts to shut down the SFR, thereby prevent¬ 
ing the formation of more massive galaxies and also the chemical 
enrichment of their gas component. Lya radiative transfer favours 
the escape of Lya photons in galaxies with lower gas metallicities, 
stellar masses and SFRs than is the case for Ha emitters. 

3.2 Protoclusters and their descendants 

We now study how the overdense regions discussed before are 
related to progenitors of massive structures at the present day. 
GALFORM can track the progenitors of any present-day halo by re¬ 
trieving its merger history. This is useful for determining which 
galaxies surrounding a central object at high redshift actually form 
part of a massive cluster at late times. We determine the progen¬ 
itors associated to a protocluster-like structure as those galaxies 
that surround a central object (radio galaxy or quasar) that at z = 0 
are within the same (Friends-of-friends) massive DM halo. Fur¬ 
thermore, we identify protoclusters as those massive structures that 
become part of a DM halo with mass Mhaio > lO'^^fMo/A]. As we 
discuss below, a significant fraction of radio galaxy and quasar en¬ 
vironments do not form a massive cluster by z = 0. Nevertheless, 
in the following we apply the same analysis to all environments 
around these central objects. 

A practical problem in observations is to determine the size of 


a protocluster. Since we can identify the progenitors of a massive 
halo, here we devise a method to determine the average size of 
overdensities around both types of central objects. We take the 
simple approach of considering protocluster members as those 
galaxies that lie within a sphere of a given radius around the cent¬ 
ral object. If that radius coincides with the most distant progenitor, 
then the sphere will contain all true progenitors, but also a signific¬ 
ant fraction of galaxies that are not progenitors of the same halo. 
Hence, to find an appropriate radius, we measure the completeness 
of a protocluster at a radius r, defined as the fraction of progenit¬ 
ors enclosed within that radius r. Likewise, we measure the purity 
of the sample at a radius r, defined as the ratio of the number of 
galaxies that are progenitors to the total number of galaxies en¬ 
closed within r. In the following, for simplicity, we will restrict the 
analysis of overdensities and progenitors to those determined using 
the sample of faint Lya emitters. 

Fig. 7 illustrates this by showing the distribution of complete¬ 
ness and purity as a function of distance from radio galaxies at 
z = 3. As expected, at small distances from the central objects the 
purity is at its highest, and the completeness is low. At a given ra¬ 
dius r*, both quantities intersect, typically at high values. We call 
r* the typical protocluster radius at that redshift, and C the corres¬ 
ponding completeness and purity at that radius. 

Table 1 shows r" and the completeness and purity at that ra¬ 
dius for radio galaxies and quasars at different redshifts. Overall, r* 
decreases towards lower redshifts. For radio galaxies, for instance, 
r* goes from 9.37 to 5.12 /i^'Mpc for redshifts z = 5.7 to z = 2.2. 
Radio galaxies at lower redshifts are, on average, hosted by more 
massive haloes than those at higher redshifts, and this is effectively 
translated into larger structures (in physical coordinates) at low red¬ 
shifts than at high redshifts. However, this also implies a decrease 
of the radius of protoclusters towards lower redshifts when sizes 
are expressed in comoving units. 

Also, the size of the proto-structures traced by radio galaxies 
have completeness and purities around ~ 0.8 at all redshifts. Quas¬ 
ars, on the other hand, have lower values of completeness and pur¬ 
ity, ranging from 0.5 at z = 2.2 to 0.7 at z = 5.7. This suggests that 
the r* values shown in Table 1 are reasonable for radio galaxies, but 
do a poorer job of representing the size of proto-structures traced 
by quasars. This is likely to be related to the large range of halo 
masses that host quasars, which is translated into a large variety of 
sizes and environments traced by quasars, unlike radio galaxies that 
are significantly more restricted in host halo mass (see Fig. 2). This 
also results in our model predicting radio galaxies to trace more 
massive descendant halo masses compared to quasar descendants. 
The fraction of protoclusters (i.e. those structures with z = 0 halo 
mass Miiaio > lO'^^fMa/A]) traced by radio galaxies and quasars dif¬ 
fers significantly. At z = 2.2, half of radio galaxies are predicted to 
trace protoclusters. On the other hand, only 19% of quasars pin¬ 
point the location of protoclusters at this redshift. These fractions 
grow considerably towards higher redshifts, since both radio galax¬ 
ies and quasars trace increasingly higher and rarer peaks of the mat¬ 
ter density distribution of the Universe towards high redshifts. 

Since the overdensity of galaxies is a biased tracer of the mat¬ 
ter content, we explore how the overdensitiy within protoclusters 
correlate with their descendant z = 0 halo mass. To do this we com¬ 
pare the overdensity of galaxies measured within r* with the z = 0 
descendant mass for protoclusters traced by radio galaxies and by 
quasars. This relation is shown in Fig. 8. Overall, protoclusters 
traced by radio galaxies become haloes at z = 0 with masses ran¬ 
ging over 10*^'^ < Mjj^“[Mo//r] < 10*^. Quasar descendants also 
reach those high halo masses, but a significant fraction have masses 


MNRAS to be submitted, 000-000 (0000) 









10 


A. Orsi et al. 



Figure 8. The relation between protocluster galaxy overdensity, 6g, measured with the faint sample of Lya emitters within r* and their z = 0 descendant halo 
mass ■ The left panels show this relation for protoclusters traced by radio galaxies at different redshifts, and the right panels with quasars. Each panel 
con'espond to a different protocluster redshift, ranging from z = 4.5 down to z = 2.2 as labelled. 


below 10'^[Mo//t] (for a discussion on quasar descendants see Fan- 
idakis et al. 2013). The overdensity of galaxies within r" correlates 
well with the descendant halo masses for both radio galaxies and 
quasar descendants. This shows that our choice of radius r* to de¬ 
termine the average extent of protoclusters at a given redshift is a 
resonable one. 

The correlation between the galaxy overdensity and the halo 
descendant mass shown in Fig. 8 can be described with a simple 
linear form: 

log(M;=«[Mo/h]) = 13.07 -f 0.11(1 -f z) -f 0.12<Sga,(r*), (9) 
log(M5'J,[Mo/h]) = 12.78 -f 0.10(1 -b z) -H 0.16<Sga,(r*),(10) 

where is the descendant halo mass in units of [Mg/A]. Al¬ 
though the constants in Eqs. (9) and (10) have similar values, their 
difference reflects the smaller descendant masses of quasars with 
respect to radio galaxies. Also, the scatter of the descendant mass 
as a function of is larger in quasars than in radio galaxies, re¬ 
flecting the broader diversity of environments that are traced by 
quasars. Despite the apparent broad scatter around the scaling re¬ 
lations, we have checked that, for radio galaxies, 68 percent of the 
distribution is within 0.2 dex of the relation given by Eq. (9). Eor 
quasars, this remains the same except at z = 2.2, where 68 percent 
of the distribution is within 0.6 dex of the scaling relation of Eq. 
( 10 ). 

Observational samples could make use of Eqs. (9) and (10) 
using r* predicted by GALFORM to obtain an approximate value 
of the z = 0 descendant that a protocluster is expected to evolve 
into. The scaling relations obtained when using the bright sample 
of Lyff emitters is consistent with the ones obtained for the faint 
sample within the 1 - <t region, making Eqs. (9) and (10) suitable 
for samples of galaxies limited by these two luminosity ranges. 

The above analysis results in simple scaling relations that 


could be easily applied to observational data. However, observa¬ 
tions generally give projected measurements of the overdensity of 
galaxies, and these are expected to differ significantly with respect 
to the spherically averaged overdensities (e.g. Muldrew et al. 2012; 
Shallow et al. 2013; Chiang et al. 2013). We can use our model 
to determine what overdensity values typical instruments would 
measure at different redshits. To do this, we project positions along 
the redshift space s coordinate (see Eq. 7), and convert the spectral 
resolution of an instrument into a comoving distance probed along 
the line-of-sight. We then compute the overdensity around a given 
protocluster by using a cylinder of radius r" and depth given by 
the spectral resolution of the instrument. In the case of very high 
resolution instruments, r* can be larger than the comoving distance 
range probed by the spectral resolution. Such cylinders would then 
probe only a fraction of the galaxies in a protocluster overdens¬ 
ity. To include all galaxies within the protocluster radius, we stack 
several cylinders of equal depth and decreasing radius around both 
sides of a central cylinder of radius r* to probe a volume that ap¬ 
proximates that of a sphere of the same radius. 

Fig. 9 shows a direct comparison between the overdensity 
computed within a sphere, and that within cylinders by pro¬ 
jecting distances along the line-of-sight, both measured at r* 
at their respective redshifts according to Table 1 . To illustrate the 
effect of different spectral resolutions, we choose four instruments, 
one for each redshift spanning 5.7 < z < 2.2. 

MOONS (Cirasuolo et al. 2014) is a future multi-object spec¬ 
trograph proposed for the VLT that, given its spectral coverage, can 
be used to find Lya emitters at z = 5.7 and beyond. Its wavelength 
resolution, AT = 0.9 A at the Lya observed wavelength, trans¬ 
lates into a line-of-sight comoving resolution at z = 5.7 of Ar^ = 
0.24/i“‘Mpc. Fig. 9 shows that our model predicts a remarkably 
accurate projected overdensity compared to the real, spherical one. 
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Similarly, WEAVE (Dalton et al. 2012), another future multi-object 
spectrograph at the 4.2 meter William Herschel Telescope on La 
Palma can be used to search for Lya emitters at z = 4.5. Its spectral 
resolution at the Lya wavelength translates into a comoving res¬ 
olution of Ar, = 0.12/r^‘Mpc. This allows to obtain very accurate 
projected overdensities, similarly to MOONS at z = 5.7. 

At lower redshifts, a given spectral resolution is translated into 
larger comoving distance ranges, meaning larger projection effects. 
At z = 3.0, for instance, we predict the projected overdensities that 
could be obtained with MUSE, a panoramic integral field unit (lEU) 
mounted on VLT (Bacon et al. 2010). Its wavelength resolution at 
z = 3.0 translates into Ar^ = L33/i“*Mpc. The median value of 
falls very close to the exact, spherical overdensity although 
the scatter around is larger than in the previous cases due to the 
larger comoving distance resolution. Finally, for z = 2.2 we illus¬ 
trate the results of using VIRUS, an array of lEUs mounted on the 
Hobby-Eberly Telescope to carry our the HETDEX cosmological 
survey (Hill et al. 2008). The spectral resolution for Lya emitters at 
z = 2.2 translates into a comoving resolution of Ar^ = 5.08/i^'Mpc, 
which is comparable to the typical size of protoclusters at this red- 
shift. The scatter is, as expected, larger than in the previous cases. 
It is worth noting that both VIRUS and MUSE can obtain projected 
overdensities with much better accuracies for higher redshifts than 
shown here. 

On the other hand, narrow band filters with a typical width of 
~ lOOA can deliver projected overdensities with resolutions along 
the line of sight of about 50 - lOO/j^'Mpc over the redshift range 
shown here (e.g. Venemans et al. 2007; Kuiper et al. 2011; Saito 
et al. 2014). Therefore, such projected overdensities are not suit¬ 
able for characterising the scales of protoclusters as shown here. 
However, given the wider areas that are accesible with a photomet¬ 
ric survey, such studies could characterise the clustering around ra¬ 
dio galaxies and quasars over scales beyond the protocluster typical 
radii to put constraints on the DM halo masses hosting these two 
central objects. 


4 CONCLUSIONS 

In this paper we have presented predictions for the properties of 
overdense regions around radio galaxies and quasars at high red¬ 
shifts using a model that includes a physical treatment for quasars, 
radio galaxies and star-forming emission-line galaxies. 

Most of our results here follow from a key prediction of our 
model, namely that, at a given redshift, radio galaxies are hosted 
on average by DM haloes that are significantly more massive than 
quasars. Eurthermore, most quasars are hosted by DM haloes of 
masses ~ 10'^[Mo/h] almost irrespective of redshift, whereas radio 
galaxies populate the most massive haloes present at any redshift. 
This crucial distinction is translated into their clustering properties 
being different, and the impact of AGN feedback being stronger in 
radio galaxy environments. 

This paper discusses two problems. First, we study the prop¬ 
erties of the overdense regions traced by radio galaxies and quasars 
at z > 2, measuring the environment with samples of emission¬ 
line galaxies and exploring how the physics of baryons impacts the 
galaxies in these environments at high redshift. Second, we identify 
the progenitors of the descendant halo that is traced by radio galax¬ 
ies and quasars to define protoclusters and link their properties to 
their descendant haloes. 

The analysis of individual overdense regions at high redshifts 
is subject to significant cosmic variance that limits their interpreta- 
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Figure 9. The overdensity of protoclusters traced by radio galaxies and 
faint Lya emitters at different redshifts. The panels compare the overdensity 
measured within a sphere, <5^° and that measured with projected distances 
along the line of sight within a cylinder, The distance probed along the 
line of sight varies depending on the spectral resolution of the instrument 
simulated, shown in the legend of each panel. 


lion in terms of general galaxy formation physics. This explains, for 
instance, why some authors have found peculiar environments that 
are average or underdense around both radio galaxies and quasars 
(Venemans et al. 2007; Husband et al. 2013; Banados et al. 2013). 

The cross-correlation functions between overdensity 
tracers (radio galaxies and quasars) and emission-line galaxies at 
high redshifts offer different information on small and large scales. 
At large scales (r > lOh^'Mpc) the amplitude of ^cc is larger when 
the central objects are radio galaxies, because these are hosted by 
more massive haloes. More interestingly, if we split the sample of 
emission-line galaxies into faint and bright galaxies, we find that 
the clustering on small scales is very different for both. For Lya 
emitters, radiative transfer effects and the higher production rate of 
ionising photons in starbursts invoked in GALFORM makes faint Lya 
emitters have a higher ^cc than bright ones. In the case of Ha emit¬ 
ters, which are not affected by radiative transfer effects, the differ¬ 
ence between faint and bright samples is insignificant. In this case, 
we find that AGN feedback prevents starbursts from dominating 
the galaxy abundance at small separations. This effect is stronger 
in radio galaxies. 

The next generation of large redshift surveys will be able to 
characterise the environments of overdense regions with unpreced¬ 
ented detail. In particular, a number of these will rely on emission¬ 
line galaxies to map the matter distribution. Spectroscopic surveys 
such as HETDEX (Hill et al. 2008) and DESI (Levi et al. 2013) 
have sufficient spectral resolution to probe the scales on which our 
model predicts that baryonic effects are noticeable on small scales. 
Other multi narrow-band surveys such as J-PAS (Benitez et al. 
2014) and PAU (Castander et al. 2012) will provide large samples 
at z ^ 2, and could measure the amplitude of on larger scales, 
thus allowing tight constraints to put on the halo masses hosting 
radio galaxies and quasars. 

Lya radiative transfer and AGN feedback also have an impact 
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on the physical properties of emission-line galaxies in overdense re¬ 
gions. Overall, the stellar mass, star-formation rate and gas metalli- 
city of Lya emitters have a small environmental dependence, which 
is stronger in the absence of Lya radiative transfer. AGN feedback 
produces average values of these properties in overdense regions 
that are different in radio galaxies than quasars, but the difference 
is too small to be probed observationally. 

The progenitors of a present-day cluster allow us to define in 
the model an average protocluster radius r* in overdensities traced 
by radio galaxies and quasars, by determining the distance at which 
the average completeness and purity of the predicted protoclusters 
coincide. This simple definition allows us to correlate the galaxy 
overdensity within the protoclusters with the halo descendant mass 
at z = 0. We compute scaling relations between these two quantities 
that should provide with a good approximation the halo descendant 
mass of observed protoclusters at high redshifts. 

Current and planned high resolution multi object spectro¬ 
graphs and IFUs are predicted to be able to measure projected over¬ 
densities that are very close to the ones computed in 3D, thus mak¬ 
ing it feasible to explore the predictions of this work with regard to 
observational data. 

The current generation of instrumentation with resolutions 
above R ~ 3000 should be able to resolve the inner structure of pro¬ 
toclusters, where the strongest departures of the galaxy properties 
with respect to the field are predicted to occur. On the other hand, 
wide-area narrow-band surveys should be able to provide large stat¬ 
istical samples to probe the clustering properties of protoclusters 
traced by quasars and radio galaxies. 
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